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APPARATUS AND METHOD FOR 
CLOCK DOMAIN CROSSING WITH INTEGRATED DECODE 

Field of the Invention 

5 The present invention relates to the field of digital circuits. More particularly, 

this invention relates to digital circuits in which a plurality of signals that need to be at 
least partially decoded are transferred from a first timing domain to a second timing 
domain. 

10 Background of the Invention 

Certain digital circuits employ a first-in first-out (FIFO) synchronizer to 
decouple transmitter from receiver timing. For example, certain dynamic random 
access memory (DRAM) devices use a FIFO synchronizer to transfer command/address 
signals timed to a command capture clock to the timing domain of an array/core logic 

15 clock. An example of a FIFO synchronizer circuit is described by William J. Dally and 
John W. Poulton in "Digital Systems Engineering", Cambridge University Press, 
Cambridge, U.K. 1998. 

One impact of a FIFO synchronizer is that there is a latency associated with the 
transfer of the command/address signals from the sending clock domain to the receiving 

20 clock domain. The length of this transfer latency is dependent upon the clock 

frequency, and the timing relationship between the two clock signals. The frequency of 
the sending clock signal is typically the same as that of the receiving clock signal. 
However, there is no predetermined phase relationship between the sending and the 
receiving clock signals. 

25 In digital circuits that employ a FIFO synchronizer to transfer command/address 

signals between clock domains, these command/address signals are transferred directly, 
with only the timing of these signals being modified. Thus, the latency that is 
associated with the transfer of these signals is not employed for any useful purpose. In 
other words, the transfer latency represents wasted time. Thus, it would be 
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advantageous to provide an apparatus and a method for transferring signals between 
timing domains that makes the latency associated with performing the transfer available 
for performing a useful purpose. 



5 Summary of the Invention 

In accordance with one aspect of the present invention, a FIFO synchronizer 
with integrated decode is provided for transferring signals such as command and/or 
address signals between timing domains. The FIFO synchronizer with integrated 
decode partially or fully decodes the signals during the transfer, instead of directly 

10 transferring the signals. The FIFO synchronizer with integrated decode advantageously 
uses the transfer latency for the purpose of decoding the signals being transferred. In 
one embodiment, a dynamic random access memory (DRAM) device includes a FIFO 
synchronizer with integrated decode for the purpose of transferring signals from a 
sending to a receiving clock domain. 

1 5 In accordance with one aspect of the invention, an apparatus for transferring 

signals between timing domains includes a receiver for receiving signals operative in a 
first timing domain, a decoder coupled to the receiver for at least partially decoding the 
signals to generate at least one decoded signal, and an output timing register coupled to 
the decoder for outputting the at least one decoded signal in a second timing domain. 

20 In accordance with another aspect of the invention, a method of transferring 

signals between timing domains of a digital circuit includes receiving signals operative 
in a first timing domain, at least partially decoding the signals to generate at least one 
decoded signal, and outputting the at least one decoded signal in a second timing 
domain. 

25 These and various other features as well as advantages which characterize the 

present invention will be apparent to a person of ordinary skill in the art upon reading 
the following detailed description and reviewing the associated drawings. 
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Brief Description of the Drawings 

FIG. 1 is a schematic block diagram of an exemplary digital circuit in which 
signals are at least partially decoded while being transferred from a first timing domain 
to a second timing domain, in accordance with one embodiment of the present 
5 invention; 

FIG. 2 is a schematic diagram of the first-in first-out (FIFO) synchronizer with 
integrated decode shown in FIG. 1, in accordance with one embodiment of the 
invention; 

FIG. 3 is a schematic diagram of one embodiment of the command receiver of 
10 the FIFO synchronizer with integrated decode shown in FIG. 2; 

FIG. 4 is a schematic diagram illustrating each enabled register shown in FIG. 3; 

FIG. 5 is a timing diagram illustrating exemplary operation of the FIFO 
synchronizer with integrated decode shown in FIG. 2; 

FIG. 6 is a schematic diagram illustrating another embodiment of the command 
15 decoder of FIG. 1, wherein the signals transferred from the first to the second timing 
domain include an enable signal (e.g., a chip select signal) for the digital circuit; and 

FIG. 7 is a schematic block diagram illustrating a processing system including a 
dynamic random access memory (DRAM) such as the DRAM that is shown in FIG. 1 . 

20 Description of the Preferred Embodiments 

In the following detailed description, reference is made to the accompanying 
drawings, which form a part hereof, and in which is shown, by way of illustration, 
specific embodiments in which the present invention may be practiced. In the 
drawings, like numerals refer to like components throughout the views. These 
25 embodiments are described in sufficient detail to enable those of ordinary skill in the art 
to practice the present invention, and it is to be understood that these embodiments may 
be combined, or that other embodiments may be utilized and that structural, logical and 
electrical changes may be made without departing from the spirit and the scope of the 
present invention. The following detailed description is, therefore, not to be taken in a 
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limiting sense, and the scope of the invention is defined by the appended claims and 
their equivalents. 

Referring to FIG. 1, an exemplary digital circuit 100 in which signals are at least 
partially decoded while being transferred from a first timing domain to a second timing 
5 domain in accordance with one embodiment of the present invention is illustrated. In 
this example, digital circuit 100 includes a dynamic random access memory (DRAM) 
device. However, circuitry for at least partially decoding signals as those signals are 
transferred between two timing domains may also be used in other types of digital 
circuits, such as in DRAM devices having other configurations, other types of memory 

10 devices (e.g., RAM, SRAM, synchronous FLASH memory, ROM, PROM, EPROM, 
EEPROM, etc.), logic devices (e.g., communications chips), etc. In general, circuitry 
for use in at least partially decoding signals as those signals are transferred between 
clock timing domains may be used in any digital circuit application where a group of 
signals that needs to be partially or fully decoded is transferred from a first timing 

1 5 domain to a second timing domain. 

In this example, digital circuit 100 is a DRAM integrated digital circuit 
including command capture latches 102, a first-in first-out (FIFO) synchronizer with 
integrated decode 104, peripheral logic/data path logic 106, memory arrays 108, 110, 
112 and 114, and write data capture circuits 116. Digital circuit 100 also includes clock, 

20 command and data transceiver circuits (not shown). The input signals to circuit 100 
include a plurality of command signals (XCMD) 118, and three clock signals including 
a cross clock signal (XCLK) 120, a read clock signal (RCLK) 122, and a write clock 
signal (WCLK) 124. Other input/output signals to circuit 100 include bidirectional data 
signals (not shown). 

25 In this embodiment, cross clock signal (XCLK) 120, read clock signal (RCLK) 

122 and write clock signal (WCLK) 124 all operate at the same frequency. However, it 
is assumed these clock signals have arbitrary phase relationships with each other. In 
other words, there is no predetermined phase relationship between XCLK, RCLK and 
WCLK. In other embodiments, there may a phase relationship between two or more of 

30 the clocks. 
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Command signals (XCMD) 1 1 8 are coupled as input signals to command 
capture latches 102, and cross clock signal (XCLK) 120 is coupled as a clock signal to 
command capture latches 102. In this embodiment, command signals (XCMD) are 
latched into the command capture latches 102 in response to a transition (e.g., a positive 
5 or negative edge) of cross clock signal (XCLK) 120. Thus, command signals (XCMD) 
1 18 are operative in the command capture timing domain that is defined by cross clock 
signal (XCLK) 120. 

The latched command signals 126 that are output from command capture latches 
102 are coupled as input signals to FIFO synchronizer with integrated decode 104, and 

10 cross clock signal (XCLK) 120 and read clock signal (RCLK) 122 are coupled as clock 
signals to FIFO synchronizer with integrated decode 104. As described in further detail 
below, FIFO synchronizer with integrated decode 104 partially or completely decodes 
the command signals 126 while transferring these signals from the command capture 
timing domain defined by cross clock signal (XCLK) 120 to the timing domain defined 

15 by read clock signal (RCLK) 122. The output signals from FIFO synchronizer with 
integrated decode 104, which are now at least partially decoded and in the timing 
domain defined by read clock signal (RCLK) 122, are designated in FIG. 1 by the 
reference numeral 128. 

The at least partially decoded command signals 128 are coupled as control 

20 signals to peripheral logic/data path logic 106, and read clock signal (RCLK) 122 is 
coupled as a clock signal to peripheral logic/data path logic 106. Peripheral logic/data 
path logic 106 is also coupled to data signals 130, which are output from write data 
capture circuits 1 16 in response to write clock signal (WCLK) 124. Peripheral 
logic/data path logic 106 uses at least partially decoded command signals 128 and data 

25 signals 130 to control accesses to memory arrays 108-1 14. These accesses will depend 
on the configuration of digital circuit 100 and may include, for example, read accesses, 
write accesses, refresh accesses and other types of accesses of memory arrays 108-1 14. 
Peripheral logic/datapath logic 106 uses read clock signal (RCLK) 122 to perform the 
accesses in the read clock domain. 
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In the embodiment of FIG. 1, FIFO synchronizer with integrated decode 104 
includes a command receiver 132, a command decoder 134 coupled to command 
receiver 132, and an output timing register 136 coupled to command decoder 134. The 
latched command signals 126 are coupled as input signals to command receiver 132, 
5 and cross clock signal (XCLK) 120 and read clock signal (RCLK) 122 are coupled as 
clock signals to command receiver 132. As described in detail below, command 
receiver 132 receives the latched command signals 126, which are operative in the 
command capture timing domain defined by cross clock signal (XCLK) 120, and 
partially transfers or synchronizes signals 126 to the read clock domain defined by the 

10 read clock signal (RCLK) 122. The output signals from command receiver 132 are 
designated by the reference numeral 138. 

Output signals 138 from command receiver 132 are coupled as input signals to 
command decoder 134, which partially or fully decodes signals 138 to generate at least 
partially decoded command signals 140. The decoding that is performed by command 

15 decoder 134, which is described further below, depends on the requirements of 
peripheral logic/data path logic 106, and the configuration of circuit 100. At least 
partially decoded command signals 140 are then coupled as input signals to output 
timing register 136, and read clock signal (RCLK) 122 is coupled as a clock signal to 
output timing register 136. As is described below, output timing register 136 

20 synchronizes at least partially decoded command signals 140 to the read timing domain 
of read clock signal (RCLK) 122. 

In the embodiment of FIG. 1, FIFO synchronizer with integrated decode 104 
includes command receiver 132 and command decoder 134 for receiving and partially 
or fully decoding the command signals that are provided by XCMD signals 118. In 

25 another embodiment, XCMD signals 118 include only address signals that are partially 
or fully decoded while being transferred between timing domains. In yet another 
embodiment, XCMD signals 1 18 include a combination of both command and address 
signals that are partially or fully decoded, alone or together, while being transferred 
between two timing domains. In still other embodiments, XCMD signals 118 also 

30 include one or more other types of signals that are transferred between two timing 
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domains and are partially or fully decoded. XCMD signals 118 may include a 
combination of these different signal types. For example, XCMD signals 118 may 
include a combination of command, address and data signals. For convenience, this 
specification assumes that XCMD signals 118 include only command signals, unless 
5 stated otherwise. It should be understood, however, that signals 118 can include any 
combination of command, address, data and other signals, with appropriate 
modifications made to the components of circuit 100 shown in FIG. 1. 

Referring to FIG. 2, FIFO synchronizer with integrated decode 104 in 
accordance with one embodiment of the invention is shown in further detail. In this 

10 embodiment, FIFO synchronizer with integrated decode 104 is particularly appropriate 
for use in high-speed DRAM integrated circuit devices operating at fast clock 
frequencies where the use of differential clock signals is advantageous. For example, 
this embodiment is suitable for high-speed DRAM devices operating at clock 
frequencies of 400 MHz, 500 MHz or even higher. It should be understood, however, 

1 5 that this embodiment can also be used for DRAM devices which operate at other clock 
speeds, either higher or lower than these clock frequencies, and can also be used for 
other types of memory or logic devices. 

In this embodiment, command receiver 132 receives input signals including a 
reset signal (RESET J 142, a pair of differential read clock signals (RCLK and RCLKJ 

20 122, a pair of differential cross clock signals (XCLK and XCLK_J 120, and four latched 
command signals (XCMD<0:3>) 126. Note that an underscore after a signal name 
indicates that the signal is active low, and "<0:n>" after a signal name indicates the 
width of the signal. Thus, for example, RESET_ indicates the reset signal is active low, 
while XCMD<0:3> indicates that there are four command bits being input. Also note 

25 that the command signals input by XCMD<0:3> can correspond to any four command 
signals, with the "<0:3>" not implying order, as long as the <0:3> notation is applied 
consistently across the circuits. For example, XCMD<0> corresponds with SCMD<0> 
and D<0>. 

In this embodiment, XCLK and XCLK_ represent a cross clock and an inverted 
30 cross clock, and RCLK and RCLK_ represent a read clock and an inverted read clock. 
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Each pair of these clock signals thus forms a pair of differential clock signals. In digital 
circuits which use differential clock signals, data or signals are captured at the time that 
the differential clocks cross. It is noted that differential clock signals advantageously 
reduce clock timing problems in high-speed digital circuits. For example, differential 
5 clock signals result in less clock noise, and in an improved duty cycle for the clock. For 
these reasons, differential clock signals such as those shown in FIG. 2 are used for high- 
speed parts such as the high-speed DRAM that is shown in FIG. 1. However, it should 
be understood that the present system can be used for transferring signals between any 
two timing domains that can be defined by any pair of two separate clock signals, 
10 regardless of whether those clock signals are differential clock signals or single-ended 
clock signals. 

Command receiver 132 receives latched command signals (XCMD<0:3>), 
which are operative in the command capture timing domain defined by differential cross 
clock signals (XCLK and XCLKJ) 120, and partially transfers or synchronizes signals 

15 126 to the read clock domain defined by differential read clock signals (RCLK and 
RCLKJ in a manner which will be described in further detail below in reference to 
FIG. 3. The output signals from command receiver 132 are designated in FIG. 2 by 
SCMD<0:3> signals 138. Signals 138, however, are not yet fully synchronized to the 
RCLK timing domain because they have not been processed by output timing register 

20 136, which is clocked by RCLK. 

Command decoder 134 receives SCMD<0:3> signals 138 from command 
receiver 132, and decodes signals 138 to generate at least partially decoded command 
signals 140. In the embodiment of FIG. 2, command decoder 134 is a four-bit decoder 
that, for each input (SCMD<0:3>) 138, asserts a logic "1" at one and only one output 

25 (Q<0;15>) 140. Thus, for example, if SCMD<0:3> 138 equals "3" (i.e., abinary 0011), 
command decoder 134 generates a logic "1" at Q<3> and generates a logic "0" at the 
other 15 Q<n> outputs 140. In other embodiments, command decoder 134 is ann-bit 
decoder that generates 2 n outputs, where n equals 1, 2, etc. As noted above, the 
decoding performed by command decoder 134 depends on the requirements of 

30 peripheral logic/data path logic 106 (FIG. 1), and on the configuration of digital circuit 



Attorney Docket No. 303.740US1 



8 



Client Ref. No. 00-1081 



100. Thus, in other embodiments, command decoder 134 will decode the SCMD<0:3> 
signals using other logic, as described below. 

Output timing register 136 receives input signals including the at least partially 
decoded command signals 140 from command decoder 134, the differential read clock 
5 signals (RCLK and RCLKJ 122, and reset signal (RESETJ 142. Using these signals, 
output timing register 136 synchronizes the at least partially decoded command signals 
140 to the timing domain of read clock signals (RCLK and RCLK J) 122 to generate at 
least partially decoded command signals (CMDDEC) 128. Register 136 includes, in 
one embodiment, a 16-bit register that uses read clock signals (RCLK and RCLK_J 122 
10 as a differential clock signal. In other embodiments, register 136 is an n-bit register, 
with n being the number of output signals from decoder 134. Note that, in generating 
CMDDEC signals 128, command signals (XCMD<0:3>) have been at least partially 
decoded while being transferred from the command capture timing domain to the read 
timing domain. 

15 FIFO synchronizer with integrated decode 104 thus includes a FIFO 

synchronizer which has a front end including command receiver 132, and a back end 
including output timing register 136. The front end captures or receives signals 
(XCMD<0:3>) operative in a first timing domain defined by differential cross clock 
signals XCLK and XCLK_. The back end synchronizes at least one decoded signal 

20 (decoded command signals 140) to a second timing domain defined by differential read 
clock signals RCLK and RCLK_. Command decoder 134 is coupled between the front 
and back ends of the synchronizer, and is used for decoding the captured signals 
(SCMD<0:3>) to generate the at least one decoded signal (decoded command signals 
140). The FIFO synchronizer with integrated decode 104 thus transfers the 

25 XCMD<0:3> signals from the first to the second timing domain while decoding these 
XCMD signals to generate the CMDDEC<0:15> signals. 

Referring to FIG. 3, command receiver 132 is shown in further detail according 
to one embodiment of the invention. In this embodiment, command receiver 132 
includes multiple-bit enabled registers 144 0 , 144 l3 144 2 and 144 3 , multiple-input 

30 multiplexers 146 0 , 146 l9 146 2 and 146 3 , a first ring counter (i.e., "ring counter 0") 148, a 
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second ring counter (i.e., "ring counter 1") 150, and a power-on reset (POR) circuit 152 
which includes a first flip-flop 154, a second flip-flop 156, a third flip-flop 158, and a 
Muller C gate 160. Note that a Muller C gate has A and B inputs, and an output C. If A 
and B are both high, then C will be high. If only one of the inputs (e.g., A) then goes 
5 low, C will remain high (i.e., the high is stored). If the other input (e.g., B) then also 
goes low, C will then go low. At this point, both A and B are low, and output C is low. 
Then, if only one of the inputs (e.g., A) goes high, C will remain low (i.e., the low is 
stored). If the other input (e.g., B) then also goes high, C will then go high. Thus, 
Muller C gate 160 stores the previous state until both inputs are in the same state, at 

10 which point the output is changed to that state. If both inputs are in different states, then 
the output will remain in its last state. 

In the embodiment of FIG. 3, command receiver 132 includes one enabled 
register 144 n and one corresponding multiplexer 146 n for each of the four bits of 
command signals (XCMD) 126. Thus, command receiver 132 includes four enabled 

15 registers 144 0 , 144 1? 144 2 , 144 3 and four corresponding multiplexers 146 0 , 146 i5 146 2 , 
146 3 . However, in other embodiments, there may be fewer than or greater than four 
command bits. For example, there may be 2, 3, 5, 6, etc. command bits. In general, if 
there are n command bits, then command receiver 132 will include n enabled registers 
and n corresponding multiplexers. 

20 Each of multiple-bit enabled registers 144 n receives input signals including the 

nth bit of command signals (XCMD<0:3>) 126, a multiple-bit enable signal (EN<0:2>) 
162, and differential cross clock signals (XCLK and XCLKJ) 120. From these signals, 
each enabled register 144 n captures one bit of command signals (XCMD) 126 in the 
command capture timing domain, and generates a multiple-bit captured command signal 

25 for the nth command bit (CMDn<0:2>) 164 n which widens or stretches out the amount 
of time that the respective command bit is available from the point in time at which it 
was captured. In one embodiment, the enabled registers stretch out the amount of time 
the command bits are available from one cycle of the command capture clock to three 
cycles. In other embodiments, the command bits are stretched out for two, four or even 
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more cycles. The manner in which each command bit is stretched out is described in 
reference to FIG. 4. 

Referring to FIG. 4, in one embodiment, an enabled register 200 implementing 
each of enabled registers 144 n (FIG. 3) includes three flip-flops 202 0 , 202j and 202 2 , and 
5 corresponding multiplexers 204 0? 204j and 204 2 . The input signals to enabled register 
200 include one bit of the command signals (XCMD<n>) 126 n , three-bit enable signals 
(EN<0:2>) 162, and cross clock signals (XCLK and XCLKJ 120. The output signals 
from enabled register 200 are the three-bit captured command signal (CMDn<0:2>) 
164 n , In an embodiment where the command bits are stretched out for only two clock 

10 cycles, each enabled register would include only two flip-flops 202 0 and 202 { and 
corresponding multiplexers 204 0 and 204 1? the enable signals would include only two 
bits (EN<0:1>), and the output signals would be only a two-bit captured command 
signal (CMDnO: 1>). Similar variations can be made to stretch out the command bits 
for other numbers of clock cycles, as would be apparent from this description to a 

1 5 person of skill in the art. 

In each enabled register 200, XCMD<n> signal 126 n is coupled to the "1" input 
of each of multiplexers 204 0 , 204 1? 204 2 , the output from each multiplexer 204 0 , 204 lf 
204 2 is coupled to the data input of the corresponding flip-flop 202 0 , 202 1? 202 2 , and the 
output of each flip-flop 202 0? 202 19 202 2 is coupled to the "0" input of corresponding 

20 multiplexer 204 0 , 204 l9 204 2 . One bit from enable signal (EN<0:2>) 162 is coupled to 
the select input of each multiplexer 204 0 , 204j 204 2 , and each flip-flop 202 0? 202 1? 202 2 
is clocked by the cross clock signals (with XCLK_ and XCLK coupled to the C and C_ 
inputs of the flip-flops, respectively). As shown in FIG. 3, EN<0:2> is driven by ring 
counter 148, which is clocked by XCLK and XCLK_. Thus, EN<0:2> successively 

25 asserts each of its bits. 

When bit 0 of EN<0:2> goes high, the "1" input of multiplexer 204 0 will be 
selected, and XCMD<n> will be clocked into flip-flop 202 0 by XCLK_ and XCLK. At 
the same time, the "0" input of multiplexers 204 : and 204 2 will be selected, and flip- 
flops 202! and 202 2 will hold their past values. On the next cycle of XCLK and 

30 XCLK_, bit 1 of EN<0:2> will go high, which will select the "1" input of multiplexer 
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204 1? and the "0" input of multiplexers 204 0 and 204 2 . Thus, XCMD<n> will now be 
clocked into flip-flop 202 1? and flip-flops 202 0 and 202 2 will hold their past values. On 
the next cycle of XCLK and XCLK_, bit 2 of EN<0:2> will go high, which will select 
the "1" input of multiplexer 204 2 , and the "0" input of multiplexers 204 0 and 204 x . 
5 Thus, XCMD<n> will be clocked into flip-flop 202 2 , and flip-flops 202 0 and 202j will 
hold their past values. Since ring counter 148 is a three-bit ring counter, this cycle will 
then repeat itself Thus, each of the enabled registers 200 operates to stretch out the 
amount of time the respective command bit is available from one cycle of the command 
capture clock to three cycles. In other embodiments, the length of ring counter 148 

10 corresponds to the number of clock cycles the command signals are stretched out. For 
example, if the command bits are stretched out to only two clock cycles, then ring 
counter 148 would be only a two-bit ring counter. 

Referring back to FIG. 3, each of captured command signals CMDn<0:2> is 
input to a corresponding one of multiplexers 146 n . Each multiplexer 146 n is a 3 x 1 

1 5 multiplexer that selects one bit of the respective multiple-bit captured command signal 
CMDn<0:2> in response to a three-bit select signal (SELECT<0:2>) 164. The output 
signals from the multiplexers 146 0 , 146 1? 146 2 , 146 3 are combined to form SCMD<0:3> 
signals 138 that, as illustrated in FIG. 2, are output to command decoder 134 for partial 
or full decoding. 

20 Select signals (SELECT<0:2>) 164 are generated by ring counter 150, which is 

clocked by read clock signals RCLK_ and RCLK. While this use of read clock signals 
RCLK_ and RCLK causes SCMD<0;3> to be partially synchronized to the read clock 
domain, the SCMD<0:3> signals are not fully synchronized to the read clock domain 
until after being processed by output timing register 136 (FIG. 2). The operation of ring 

25 counter 150 is coordinated with the operation of ring counter 148 so that ring counter 
150 selects a particular bit of the four CMDn<0:2> signals after ring counter 148 has 
selected that bit, and before ring counter 148 selects that bit again. This coordination of 
the ring counters is due to the operation of POR circuit 152, which is described in detail 
below. 



Attorney Docket No. 303.740US1 



12 



Client Ref. No. 00-1081 



In the embodiment of FIG. 3, ring counter 148 is driven by the positive edges of 
the cross clock (i.e., XCLK drives input C of ring counter 148), while ring counter 150 
is driven by the negative edges of the read clock (i.e., RCLK_ drives input C of ring 
counter 150). Thus, ring counter 148 changes the enable signals (EN<0:2>) 162 off the 
5 positive edges of XCLK, while ring counter 150 changes the select signals 

(SELECT<0;2>) 164 off the negative edges of RCLK (as shown by the timing diagram 
of FIG. 5), Driving the ring counters 148 and 150 in this manner advantageously 
improves the timing margin and setup time for command data to pass through 
multiplexers 146 n and over to output timing register 136, which captures the decoded 

10 command data off the positive edges of RCLK. This increase in timing margin helps to 
increase the reliability of circuit device 100. In other embodiments, these circuits may 
be driven by other transitions of the clock signals. 

The EN<0:2> signals are thus used in capturing the command bits in the enabled 
registers, and the SELECT<0:2> signals corresponding to the previous EN<0:2> signals 

15 are used to open up the multiplexers 146 n to transfer the command out on SCMD<0:3>. 
The SCMD<0:3> signals are similar to the XCMD<0:3> signals, except that they will 
be synchronized to the SELECT<0:2> signals driven by RCLK. Note that the 
SCMD<0:3> signals are not yet fully synchronized to RCLK since they have not yet 
been latched into a register driven by RCLK. Full synchronization of SCMD<0:3> to 

20 the RCLK domain is performed by output timing register 136, which is driven by the 
positive edge of RCLK. Also note that driving ring counter 150 on the negative edge of 
RCLK, while driving output timing register 136 on the positive edge of RCLK, reduces 
the latency of the signals passing through digital circuit 100. Greater margin is 
available by clocking both ring counter 150 and output timing register 136 on the same 

25 clock edge at the expense of longer latency through digital circuit 100. The outputs 
from output timing register 136 are now in the RCLK timing domain, and can then be 
used throughout the DRAM's internal circuits as control signals. 

In an embodiment where the command bits are stretched out for only two clock 
cycles, each multiplexer 146 n would be only a 2 x 1 multiplexer having two data input 

30 signals (CMDnO: 1>) and a two-bit select input signal (SELECTO: 1>), and ring 
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counter 150 would be a two-bit ring counter for generating that two-bit select signal 
Similar variations can be made for other embodiments where the command bits are 
stretched out for other numbers of clock cycles, as would be apparent to a person of skill 
in the art. 

5 Note that, in the embodiment of FIGs. 3 and 4 where commands are stretched 

out or held for three clock (i.e., XCLK) cycles, the clock is allowed to vary in both 
directions. In contrast, in an embodiment where the commands are only stretched out or 
held for two clock cycles, the clock would only be allowed to vary in one direction. 
Thus, stretching out the command signals for three clock cycles may be preferable since 

10 it provides three clock cycles of variation in which to transfer command signals from 
the XCLK timing domain to the RCLK timing domain, and allows the relative clock 
phase to shift to either the negative or positive directions from a zero mid-point. In 
other words, by holding the command for three clock cycles, both negative and positive 
shifts can be reliably handled. 

15 The purpose of POR circuit 152 is to insure that, at some point in time after all 

the bits of a command (e.g., XCMD<0:3>) are captured in one of the three flip-flops of 
all of the enabled registers 144 0 , 144 l5 144 2 , 144 3 , but before a new command is 
captured in that same flip-flop of all of the enabled registers three clock cycles later, the 
corresponding select input to the 3 x 1 multiplexers 146 0 , 146 1? 146 2 , 146 3 is selected in 

20 order to allow that captured command to pass through the 3 x 1 multiplexers. For 
example, assume ring counter 148 sets bit 0 of EN<0:2> to 1 so that flip-flops 202 0 of 
all four enabled registers 144 0 , 144 l9 144 2 , 144 3 capture a four-bit command that is timed 
to the capture clock. This captured command will be available as bit 0 of the outputs 
from the four enabled registers for three cycles of the capture clock before flip-flops 

25 202 0 of the four enabled registers are used to capture another command. POR circuit 
152 insures that multiplexers 146 0 , 146 1? 146 2 , 146 3 are opened up to pass the captured 
command stored in flip-flops 202 0 at some point in time after EN<0> was set high to 
capture that command in flip-flops 202 0 , but before EN<0> becomes set high again to 
capture a new command in those flip-flops 202 0 . 
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The input signals to POR circuit 152 include RESET_ signal 142, cross clock 
signals (XCLK and XCLK J 120, read clock signals (RCLK and RCLKJ 122, and bit 
1 of enable signals (EN<1>) 162. The output signals from POR circuit 152 include a 
reset signal (RINGCNTRORJ) 166 for ring counter 148, and a reset signal 
5 (RINGCNTR1RJ 1 68 for ring counter 150. During a power-on reset, RESET_ signal 
142 is low, and flip-flops 154, 156 and 158 (and ring counters 148 and 150) are reset. 
When RESET_ goes high, RESET_ is synchronized by XCLK_ and XCLK signals 120 
using flip-flop 154 to generate a delayed reset signal 170. Delayed reset signal 170 is 
synchronized by XCLK_ and XCLK signals 120 using flip-flop 156 to generate the 

10 reset signal (RINGCNTRORJ) 166 for ring counter 148, such that ring counter 148 can 
start to run responsive to XCLK and XCLK_ after RINGCNTR0R_ goes high. Delayed 
reset signal 170 is synchronized by RCLK and RCLK_ signals 122 using flip-flop 158 
to generate a second delayed reset signal 172. Second delayed reset signal 172 and 
EN<1> are the inputs to Muller C gate 160, which generates reset signal 

15 (RINGCNTR1R_) 168 for ring counter 150, such that ring counter 150 can start to run 
responsive to RCLK_ and RCLK after RINGCNTR1R_ goes high. Initially, when 
RESET_ is low, both inputs to Muller C gate 160 are low, and the output 168 from 
Muller C gate 160 keeps ring counter 150 reset. Then, after RESET_ goes high, the 
EN<1> input to Muller C gate 160 insures that the output 168 from Muller C gate 160 

20 keeps ring counter 150 reset until EN<0> has been high and gone low. The output from 
Muller C gate 160 does not allow ring counter 150 to start to run in response to the 
RCLK_ and RCLK signals until EN<1> and signal 172 are both set. Thus, ring counter 
150 will not begin to run until after ring counter 148 has already started to run. Once 
the output 168 from Muller C gate 160 is set (which allows ring counter 150 to start to 

25 run), that output signal remains set since signal 172 will remain set. Therefore, after 
ring counters 148 and 150 have been properly coordinated following a power-on reset 
by POR circuit 152, ring counters 148 and 150 will remain properly coordinated (since 
RESET_ will remain high). Ring counter 150 will now select a particular bit after ring 
counter 148 selects that bit, and before ring counter 148 selects that particular bit again. 
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Referring to FIG. 5, operation of FIFO synchronizer with integrated decode 104 
is illustrated by an exemplary timing diagram. As shown in the top two traces, the 
XCLK and RCLK clock signals have the same frequency, but are not in phase with each 
other. As noted above, the phase relationship between XCLK and RCLK is assumed 

5 arbitrary. The EN_2_, EN_1_ and EN_0_ traces show the states of enable signals 
(EN<0:2>) 162, which are output from ring counter 148 and clocked by XCLK and 
XCLK_ to repeat the three-bit ring counter sequence of 001 2? 010 2 and 100 2 . The 
SELECT_2_, SELECT_1_ and SELECT_0_ traces show the states of select signals 
(SELECT<0:2>) 164, which are output from ring counter 150 and clocked by RCLK_ 

10 and RCLK to also repeat the three-bit ring counter sequence of 001 2 , 010 2 and 100 2 . 
However, as shown by the arrows from EN<0:2> to SELECT<0:2>, each SELECT<n> 
bit goes high only after the corresponding EN<n> bit goes high, and before that same 
EN<n> bit goes high again. This relationship between EN<0:2> and SELECT<0:2> 
reflects the coordination between the first and the second ring counters 148 and 150 that 

1 5 was described above. The XCMD<3:0> trace shows the state, in hexadecimal notation, 
of exemplary command signals 126 being input to the FIFO synchronizer with 
integrated decode 104. In this example, command signals XCMD<3:0> start at FH, and 
then cycle through each of its 16 states (i.e., 0 through FH). Finally, the CMDDEC_n_ 
traces show the states of the 16 decoded output signals 128. The CMDDEC_n_ traces 

20 reflect the decoded states of XCMD<3:0> signals 126, which have also been transferred 
from the XCLK timing domain to the RCLK timing domain. 

As noted above, command decoder 134 partially or fully decodes the command 
signals to generate at least partially decoded command signals in a manner that depends 
on the requirements of peripheral logic/data path logic 106, and on the configuration of 

25 digital circuit 100. In the embodiment shown in FIG. 2, four command bits are fully 
decoded using a four-bit command decoder 134. In this case, the four command bits are 
"fully" decoded since command decoder 134 generates a unique decoded output for 
each of the 16 (i.e., 2 4 ) possible combinations of inputs. In another embodiment, the 
command bits are only "partially" decoded. For example, it may be that only five 

30 combinations of the four command bits are valid for digital circuit 100, and that the 
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other 1 1 combinations are invalid (and will not normally occur). In this case, the 
command decoder may only partially decode the four command signals to generate five 
unique decoded outputs. 

More generally, the command decoder generates one or more than one decoded 

5 output signals in response to the command signal inputs. In some embodiments, the 
command decoder includes decode logic which generates one and only one output 
signal in response to each combination of input signals. In other embodiments, the 
command decoder includes decode logic which generates more than one output signal in 
response to each combination of input signals. In cases where the FIFO synchronizer 

10 with integrated decode is used for synchronizing and decoding non-command input 
signals, such as address signals or data signals, the command decoder is replaced by a 
"decoder" which performs the decoding which is required for the particular 
configuration of the circuit. 

Referring to FIG. 6, a command decoder 300 according to another embodiment 

15 of the invention is illustrated. In this embodiment, it is assumed that the command 

signals XCMD<0:n> received by command receiver 132 includes command and enable 
signals. For example, as shown in FIG. 6, the SCMD<0:n> inputs to command decoder 
300 may include the command signals SCMD<0:n-l> 302 and an enable signal 
SCMD<n> 304. Enable signal SCMD<n> has a first state in which the digital circuit 

20 100 is enabled, and a second state in which the digital circuit is not enabled. For 

example, the enable signal may be a chip select signal for an integrated circuit device 
(e.g., a chip select signal for a DRAM). In this example, command decoder 300 
partially or fully decodes the command signals SCMD<0:n-l> to generate one or more 
than one decoded command signal 306 if enable signal SCMD<n> indicates that the 

25 circuit is enabled, and inhibits decoding of the command signals SCMD<0:n-l> if 
enable signal SCMD<n> indicates that the circuit is not enabled. If decoding is 
inhibited, output timing register 136 does not output the one or the more than one 
decoded command signals 306. In an alternative embodiment, the enable signal is 
applied as an output enable input to output timing register 136, such that register 136 

30 inhibits or disables the output of at least one decoded command signal 306. 
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Referring to FIG. 7, in accordance with another embodiment of the invention, a 
processing system 400 includes a microprocessor 402, a memory 404, and a bus 406 
that couples microprocessor 402 and memory 404 together. In this embodiment, 
memory 404 is a digital circuit in which command signals are at least partially decoded 
5 while being transferred from a first timing domain to a second timing domain as 

described above. In other embodiments, processing system 400 includes a plurality of 
memories 404, some or all of which are enabled (e.g., selected) by an enable (e.g., chip 
select) signal on bus 406. 

In one embodiment, memory 404 is a DRAM memory device like that of FIG. 1, 

10 which receives three separate clock signals from bus 406. These clock signals include a 
first clock for timing command/address signals, a second clock for timing read data, and 
a third clock for timing write data. The device also includes internal synchronous 
circuitry, denoted as peripheral logic/data path logic 106 in FIG. 1, for performing 
functions such as accessing memory arrays, transferring data to I/O circuits in order to 

1 5 drive data out of the device, transferring data to I/O circuits in order to write data into 
the device, etc. To run this synchronous circuitry, one of the clocks being input to the 
device is selected. In one embodiment, the command/address clock (i.e., XCLK) is 
used only for capturing the command/address signals into command/address input 
latches. The write clock is used only for capturing the write data into write data latches 

20 at the DQ nodes. The read clock, in this embodiment, is used both for driving the 
latches for the data being output from the device, and for driving the peripheral 
logic/data path logic 106 for accessing the arrays. In other words, the read clock also 
serves as the DRAM internal system clock. Thus, the command/address and write 
clocks are used only for capturing signals, while the read clock is used both for reading 

25 signals and for driving the synchronous DRAM logic. 

In this DRAM embodiment, then, the read clock is used to run the internal 
DRAM synchronous circuitry. After commands are captured in the command/address 
(XCLK) timing domain, the commands must be transferred consistently to the read 
clock domain. These transfers must be performed consistently despite variations in 

30 process, temperature and voltage. While the process may be the same for any one part, 
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variations in voltage and temperature could adversely affect the relationship between the 
command/address clock and the read clock if a FIFO synchronizer circuit were not 
employed, such that the transfer of the command signals could make the DRAM device 
unreliable. In particular, DRAM devices typically have a specified latency requirement 
5 that defines the maximum amount of time from receipt of a command by the DRAM 
device until the DRAM device provides output data. This latency is specified in the 
DRAM's data sheet, and must be maintained regardless of variations in process, 
temperature and voltage. For example, a particular DRAM device could have a 
maximum latency requirement of 15 nsec, which must be met under all conditions. 
10 Even if the phase relationship between the read clock and the command/address clock 
changes, this latency requirement must always be met. One purpose of FIFO 
synchronizer with integrated decode 104 is to capture the command bits in the enabled 
registers of the command receiver 132 in the command/address clock domain, and then 
transfer the command bits to the output timing register to synchronize the command 
15 with the read clock domain. The integrated command decoder 134 advantageously uses 
the latency associated with this transfer to perform useful work. 

Conclusion 

Thus, digital circuit 100 including a FIFO synchronizer with integrated decode 
104 has been described herein. FIFO synchronizer with integrated decode 104 includes 
command receiver 132 for receiving a plurality of command signals 126 operative in a 
first timing domain defined by cross clock XCLK 120, command decoder 134 coupled 
to command receiver 132 for at least partially decoding command signals 126 to 
generate at least one decoded signal 140, and output timing register 136 coupled to 
decoder 134 for outputting the at least partially decoded command signals in a second 
timing domain defined by read clock RCLK 122. In one embodiment, the transferred 
signals include command signals. In other embodiments, the transferred signals include 
command, address, data and/or other types of signals that need to be partially or fully 
decoded. 
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In one DRAM embodiment, FIFO synchronizer with integrated decode 104 is 
used for transferring command/address signals timed to a command capture clock to the 
timing domain of an array/core logic clock. The latency associated with the transfer of 
the command bus signals from the sending clock domain to the receiving clock domain, 

5 which depends on the frequency of the clocks and on the timing relationship between 
the two clocks, is advantageously used by the decoder to partially or fully decode the 
signals during the transfer. Thus, by inserting the decoder between the front and back 
ends of the FIFO synchronizer, the time that is typically wasted during the transfer 
between timing domains is used productively, rather than wasted. The resulting signals 

10 are partially or fully decoded signals, rather than signals directly transferred between 
timing domains. 

The above description and the accompanying drawings are intended to be 
illustrative of the present invention, and not restrictive. Many other embodiments will 
be apparent to those of ordinary skill in the art upon reviewing the above description. 
1 5 The scope of the invention should therefore be determined with reference to the 
appended claims, along with the full scope of equivalents to which such claims are 
entitled. 
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